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(54) Apparatus and method for pattern recognition 



(57) A method and an apparatus for pattern recog- 
nition are described. Features characterizing at least 
two significant portions or regions of the printed pattern 
on a model document are extracted from a scanned-in 
image of the model document. Statistical characteristics 
of each significant region are calculated from the 
extracted features and saved in a memory. In addition, 
geometrical relationships, e.g. distances and angles, 
between the regions are also saved in a memory. The 
geometrical relationships are represented by the coordi- 
nates of the regions in a predetermined coordinate sys- 
tem, e.g. the x-y coordinate system corresponding to 
the scan area on the platen of a scanner. When a new 
or trial document is scanned and examined through a 



search window, features are extracted from the search 
window blocks and compared to the statistical charac- 
teristics of the model document previously stored in 
memory. If there is a match, the coordinates of the 
search window having the matching features in the trial 
document are recorded in a candidate list. After me 
entire trial document has been processed, the geomet- 
ric relationships between the window blocks on the can- 
didate list are compared with the geometrical 
relationships of the matching significant region blocks of 
the model document and if the relationships correspond 
the trial document is considered a match. 
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Description 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] This invention relates generally to image 
processing machines, for example copying machines or 
scanner/fc>rinters, and more particularly to high-end 
color copiers and scanner/printers with features and 
capabilities enabling forgery of currency, securities, 
negotiable instruments, etc. The invention, is further 
related to apparatus and methods for the prevention of 
such forgery on copying machines and the like but is 
also applicable to any image processing device that 
scans a document and has the capability to compare a 
scanned image to a stored image, especially for the pur- 
poses of authentication of the document or prevention 
of its replication. 

Description of the Related Art 

[0002] A number of systems have been proposed for 
recognizing valuable documents both for verification of 
authenticity and for prevention of forgery These gener- 
ally involve storing an accurate representation of a gen- 
uine document, scanning a new document and 
comparing the scanned document to the stored repre- 
sentation. In vending machines and money changing 
machines, for example, the desire is to prevent the 
acceptance of counterfeit currency. Bills submitted into 
the machines are scanned and compared to one or 
more acceptable currency patterns. If the scanned pat- 
tern matches a pattern of one of the genuine bills with a 
predetermined degree of similarity then the machine 
accepts the bill and delivers the change or merchan- 
dise. One such currency identification system is dis- 
closed in U.S. Patent No. 4;386,432, issued to 
Nakamura et al. Typical of such systems, Nakamura 
conveys the currency note or bill at a fixed speed 
through a fixed scanning area and detects a character- 
istic of the bill. In this case it is the presence of a metallic 
element contained in the printing ink forming the print- 
ing pattern on the bill. Such systems typically require 
that the entire bill be scanned in a predetermined direc- 
tion and a predetermined orientation. While such 
requirements may not be an important limitation in 
vending machines or change machines where the user 
must insert the bill into a fixed dimension slot in a fixed 
orientation, such requirements make the identification 
system inoperative where the scanning orientation is 
not fixed, e.g. on a copier, where the user can place the 
bill in a variety of positions on the platen glass. 
[0003] Alternative systems, have been proposed that 
do not rely as much on a specific orientation of the doc- 
ument on the scanning support surface. Instead these 
systems attempt to recognize a particular aspect of the 
document that uniquely identifies it as one that should 
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not be duplicated. U.S. Patent Nos. 5,440,409 and 
5,481,378, issued to Sugano et al. detect a particular 
color of the background of confidential documents. For 
example if documents are not intended to be copied the 

s original document is printed on a red background. The 
document scanning section of the copier is provided 
with a color detecting feature which filters out all but a 
particular color on the scanned document. It then com- 
pares the density of the color signal to a pre-stored color 

10 table to determine if it matches the background color of 
documents that are not to be copied. If a match is found 
copying is prevented. Such systems have limited appli- 
cation to documents in which a particular color can be 
chosen as a background color and is not applicable to 

75 situations where the particular color may appear on 
documents that can legitimately be copied. For exam- 
ple, the green on a U.S. bill may appear on any number 
of documents that can be legitimately copied. 
[0004] U.S. Patent Nos. 5,379,093 and 5,515,451 

20 issued to Hashimoto et al. and Tsuji et al., respectively, 
seek to detect a combination of characteristics of one 
particular feature of the document to be copied. In these 
patents, first a particular color is detected. Then, the 
image pattern having the particular color is compared to 

25 pre-stored standard patterns to detect a certain degree 
of similarity. Then, the pattern is analyzed to determine 
if it conforms to a particular shape, i.e. a circle. These 
systems have particular utility in detecting one unique 
aspect of a document, i.e. the stamp mark on bank 

30 notes issued by the Bank of Japan. 

[0005] Additional systems have been proposed which 
seek to address the problem of document orientation on 
the platen glass. U.S. Patent No. 5,216.724 , issued to 
Suzuki et al., and U.S. Patent No. 5,533,144, issued to 

35 Fan, are typical of such systems. In Suzuki, for example, 
the document can be placed in any orientation on the 
platen. The document is scanned and the scanned 
images are processed such that the four corners of the 
document are determined and then the angle of the 

40 document on the platen is calculated. Then, one partic- 
ular feature of the document, e.g. the red stamp mark 
on a Japanese bank note, is compared to a pre-stored 
standard pattern which corresponds generally to the 
calculated angle of the document. A number of standard 

45 patterns at various orientation angles are pre-stored. 
Fan proposes a somewhat different system in which the 
orientation of the document is first calculated and then 
the pre-stored patterns or templates are rotated to 
match the angle of the scanned document. In both of 

so these systems the orientation of the document must first 
be calculated, which requires the detection of edges 
and corners. 

Objects of the Invention 

55 

[0006] Therefore, it is an object of the present inven- 
tion to overcome disadvantages of conventional sys- 
tems for recognizing original documents or patterns 
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thereon. 

[0007] An object is to provide an improved system for 
detecting an original document with a high degree of 
certainty but without a commensurately high amount of 
processing. s 
[0008] Another object is to provide an improved sys- 
tem for detecting an original document that is independ- 
ent of the orientation of the document when it is being 
scanned. 

10 

Summary of the Invention 

[0009] The above objecs are solved according to the 
present invention by an apparatus, method, medium 
and system as defined in the dependent claims. 75 
[0010] According to the present invention, features 
characterizing at least two significant portions or 
regions of the printed pattern on a model document are 
extracted. Statistical characteristics of each significant 
region are calculated from the extracted features and 20 
saved in a memory. In addition, geometrical relation- 
ships, e.g. distances and angles, between the regions 
are also saved in a memory. The geometrical relation- 
ships are represented by the coordinates of the regions 
in a predetermined coordinate system, i.e. the x-y coor- 25 
dinate system corresponding to the scan area on the 
platen. When a new or trial document is scanned and 
examined through a search window, features are 
extracted from the search window blocks and compared 
to the statistical characteristics of the model document 30 
previously stored in memory. If there is a match, the 
coordinates of the search window having the matching 
features in the trial document are recorded in a candi- 
date list. After the entire trial document has been proc- 
essed, the geometric relationships between the window 35 
blocks on the candidate list are compared with the geo- 
metrical relationships of the matching significant region 
blocks of the model document and if the relationships 
correspond the trial document is considered a match. In 
a color copier, for example, this positive comparison 40 
would be used to prevent the trial document from being 
copied and in a vending machine, for example, this pos- 
itive comparison would be used to allow delivery of mer- 
chandise. The present invention is operative regardless 
of the location of the document on the scanning support 45 
surface (transtational invariant) and regardless of the 
angular orientation (rotational invariant). Further, the 
location and angular orientation of the trial document 
need not be determined in order to compare it to the 
model document. so 
[0011] In the present invention, accuracy of the sys- 
tem is improved by first scanning in several samples of 
the model document, e.g. 30 samples of currency. This 
helps to account for variations in genuine documents, 
e.g. old, worn, dirty or wrinkled bills. For each sample, ss 
significant regions are selected and cropped out for fur- 
ther processing. In the example of currency, significant 
regions may include numbers, portrait, official seal, etc. 



on the bill. In the present invention the entire model doc- 
ument is not used for comparison, which improves effi- 
ciency and reduces sensitivity to noise. For each 
scanned-in pattern comprising the cropped -out signifi- 
cant regions, the present invention transforms the pat- 
tern to create a number of additional exemplars or 
virtual training patterns. For example, up to 200 virtual 
training patterns can be generated from one original 
pattern by applying various affine transformations (e.g. 
rotation, shifting). Then, features of each significant 
region are extracted out of all the training patterns, and 
the statistical characteristics (e.g. measures of disper- 
sion, such as means and variances) of the features are 
computed for all training patterns. The computed statis- 
tical characteristics are then stored in a memory along 
with the coordinates of each significant region which the 
statistical characteristics identify. This aspect of the 
invention is the registration mode which generates the 
memory storage or database which will be used for 
comparison with trial documents, e.g. bills or certificates 
introduced for copying. 

[0012] The second mode of the invention is the detec- 
tion mode which is activated each time a trial document 
is scanned in. A two-dimensional window search is per- 
formed on the low-resolution preview image (e.g. 50 or 
90 dots-per-inch, hereafter dpi) of the scanned docu- 
ment, which has a sufficient resolution for detection and 
comparison purposes. The search window size is the 
same as the block size for the significant regions 
selected in the registration mode. At each searching 
step, features are extracted from the searching window 
by the same feature extraction procedure used in the 
registration mode. Similarity scores are calculated by 
comparing the features extracted from the trial docu- 
ment to the statistical characteristics of each significant 
region stored in the registration mode. If the similarity 
score exceeds a predetermined threshold the coordi- 
nates of the current location of the searching window 
are recorded into a candidate list. Once the window 
search is completed, the geometrical relationships of 
the image blocks in the candidate list are determined. If 
the relationship is the same as the relationship of the 
corresponding significant region image blocks, the trial 
document is deemed to match the model. In a color cop- 
ier, for example, a match could be used to prevent a 
copy of the trial document from printing or could cause 
a pre-set pattern to be printed such as an all black 
image or a printed warning against unauthorized copy- 
ing. In a vending machine, for example, a match would 
enable merchandise to be dispensed. 
[001 3] Other objects and attainments together with a 
fuller understanding of the invention will become appar- 
ent and appreciated by referring to the following 
description and claims taken in conjunction with the 
accompanying drawings. 
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Brief Description of the Drawings 

[0014] In the drawings wherein like reference symbols 
refer to like parts 

Figs. 1 A through 1E are block diagram representa- 
tions of various general configurations of the envi- 
ronment of the present invention. 
Fig. 2 is a schematic block diagram of the major 
functional components of the present invention. 
Fig. 3 illustrates a document placed in a Cartesian 
coordinate scanning space S of the present inven- 
tion. 

Fig. 4 is a flowchart of the registration mode of the 
present invention. 

Fig. 5 is a flowchart of the feature extraction proc- 
ess of the present invention. 

Fig. 6 is a block diagram illustration of the feature 

extraction process of the present invention. 

Figs. 7A and 7B are flowcharts of the detection 

mode of the present invention. 

Fig. 8 illustrates the relationship between a model 

document having two significant regions and a test 

document with two matching feature areas. 

Fig. 9 illustrates the relationship between a model 

document having three significant regions and a 

test document with three matching feature areas 

Description of the Preferred Embodiments 

[001 5] Reference is now made to Figs. 1 A through 1 E 
which show the general configuration of an image han- 
dling unit 10 which could be part of a color copier, scan- 
ner/printer, vending machine, etc. As shown in Fig. 1 A, 
imaging handling unit 10 has three major components, 
a scanner 12 for reading in the pattern of a document, 
an image processing unit 1 4 for processing the scanned 
image and, in the present invention, for comparing the 
scanned document with a pre-stored model document, 
and an output device such as a printer 16 or a merchan- 
dise unit 18 (e.g. as in a vending machine). The scanner 
can take various forms such as a conventional light 
emitting scanner 12 found in copiers (Fig. 1B) or may 
take the form of a digital camera 20 directly connected 
to a printer 16 (Fig. 1E). 

[001 6] The image processor 1 4 may be implemented 
in hardware with discrete components, software, 
firmware, application specific integrated circuits 
(ASICs), or any combination thereof. Also, the func- 
tional blocks of the image processor are divided in this 
specification for convenience of description only. The 
functional and physical boundaries of these blocks will 
vary from device to device. For example, Fig. 1 B shows 
the image processor physically integrated with the scan- 
ner and printer units in a single housing of a color cop- 
ier. Portions of the image processor may be associated 
functionally more with the scanner unit than with the 
printer unit or vice versa. Fig. 1C shows an embodiment 
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of a scanner/printer device with the image processor 
formed as part of a personal computer (PC) 22 which 
may control operation of and communication between 
the scanner, printer and image processing unit, and 
5 control of and communication with peripheral equip- 
ment such as I/O device 24, each connected directiy or 
indirectly to a PC Bus 23. In this embodiment, the 
"scanned image" may be have been previously stored 
(and perhaps enhanced through processing) in an I/O 
70 device 24 and can be loaded into the PC through I/O 
interface 26, or the image may be captured with a digital 
image input device such as a digital camera 20. In addi- 
tion, the information processing unit 14, in the form of 
software, may be loaded into the PC's memory from an 
rs external storage device, i.e. I/O device 24. Alternately, 
the information processor in the form of hardware, 
ASIC, firmware, etc. or combination thereof can be 
embodied on an option card 28 that can be inserted into 
an available PC card slot. Fig. 1 D shows an embodi- 
20 ment in which the image processing unit 1 4 is physically 
part of the scanner 12, while Fig. 1E shows the image 
processor as a physical part of the printer 1 6. In Fig. 1 E, 
the image is input with a digital camera 20 rather than a 
conventional scanner 12. 
25 [001 7J While the present invention is applicable to any 
such device having these basic components, for the 
sake of illustration only the invention will be described in 
the environment of a particular image handling unit 
such as color copier or scanner/printer shown in Fig.2. 
30 Image handling unit 10 includes a clear, transparent 
support 30 (e.g. glass platen) on which a document 32 
is placed and protected by cover 34, which is generally 
solid white on the side facing the platen 30. Document 
32 can of course be any type, such as currency (bill or 
35 bank note), stock certificate, bond or any paper, but 
generally one that has value and is prohibited from 
unauthorized copying. A lamp 36 illuminates document 
32 with a white light, generally, and the reflected light is 
directed by mirrors 38. The reflected light is focused by 
40 lens 40 onto sensors 42 that deliver the color image 
information in the form of electrical signals to signal 
processing unit 44 of image processing unit 14. 
[0018] The image information is typically divided into 
sets of channels or color spaces, such as RGB or CIE- 
45 LAB. In RGB color space, three channels, R, G and B, 
carry red, green and blue signals, respectively. In CIE- 
LAB space, the L channel represents luminance infor- 
mation and the a and b channels represent 
chrominance information.. Signal processing unit 44 
so will, in part, convert continuous tone images into binary 
format (i.e. a halftoning process) for printer 16. Signal 
processing of image data is well known in the art and a 
detailed explanation of signal processing unit 44 is not 
necessary for an understanding or appreciation of the 
55 present invention. The scanned in image information is 
stored in an image buffer 46, which may be any suitable 
memory or an assigned area of a memory, e.g. a ran- 
dom access memory (RAM). 
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[0019] In the registration mode of the present inven- 
tion, the scanned in image information is divided into 
image blocks and certain blocks are cropped out from 
the entire image by significant region extractor 48 of 
image processing unit 14. As an example, consider doc- 
ument 32 as a U.S. dollar bill. Significant regions for 
comparison might include the denomination section in 
the lower left corner, the portrait in the center and the 
green Department of Treasury seal in the right center. 
Selection of significant regions (step S2 in Fig. 4) and 
selection of the block size of the regions depends on the 
application of the present invention, i.e. the type of doc- 
ument being considered. The coordinates of the signifi- 
cant regions selected are pre-stored in significant 
region coordinate memory 58 (step S4). It is to be noted 
that the various memory storage blocks, i.e. image 
buffer 46, significant region buffer 52, significant region 
coordinate memory 58, etc., maybe implemented in dif- 
ferent physical memories and different types of memo- 
ries, e.g. RAM, disc, or may be simply different 
addressable areas of one physical memory. 
[0020] In the registration mode, the document is 
placed on the platen in a predetermined position. For 
example, with reference to Fig.3, the scanning space of 
the platen 30 is shown as plane S with an x-y Cartesian 
system having the 0.0 point of the system in the upper 
left hand corner of the platen. With the document posi- 
tioned as shown and scanned in, blocks of image data 
can be cropped out according to their coordinates. For 
example, the blocks to be cropped could include one 
having a center position in the upper left region R1 cor- 
responding to the denomination on a bill, a second hav- 
ing a center position in the center region R2 
corresponding to the portrait on the bill, and a third block 
having a center position in the lower center region R3 
corresponding to the green treasury seal. 
[0021 ] The significant region extractor 48 is shown in 
greater detail in Fig.4. Significant region extractor 48 
can be embodied in software, hardware, firmware, ASIC 
or combination thereof. A new document (e.g. an origi- 
nal bill) is placed on the platen in the 0,0 position at step 
S6. The entire document is scanned at step S8 and the 
image data is stored in an image buffer 46. At step S10, 
the data in the image buffer is examined by moving an 
extraction window through the image data in a 2-dimen- 
sional search (in effect, across and down the docu- 
ment). The size of the extraction window is the same as 
the significant region block size. When the scan window 
location coincides with the predetermined coordinates 
of a significant region at step S12, that block of image 
data is stored with a coordinate identifier in significant 
region buffer 52 at step S14. This process continues 
until the document examination is completed at step 
S16. A new document (e.g. another bill of the same 
denomination) is then placed on the platen and the 
process is repeated until a sufficient number of samples 
have been input to account for typical variations in orig- 
inal documents, e.g. worn or stained bills. For example. 



30 currency samples can provide a reliable test base. 
Once significant regions have been extracted for a 
number of samples (step S18), additional training pat- 
terns are created by applying various aff ine transforma- 

5 tions (step S20) to the image data blocks by affine 
transformer 50. Affine transformer 50 .translates the 
image data blocks to all posstole locations in the scan-in 
image space S and rotates the image blocks through all 
rotation angles, and the resultant transformations are 

10 stored along with the original image blocks as training 
patterns in significant region buffer 52 (step S22). Up to 
200 training patterns, for example, can be generated by 
applying translational and rotational affine transforma- 
tions to one original pattern. Affine transformer 50, 

15 which can be embodied in hardware, software, 
firmware, ASIC, or combinations thereof, is a well 
known feature in the image processing art and a 
detailed description is not necessary for an understand- 
ing or appreciation of the present invention: 

20 [0022] Once the training patterns have been estab- 
lished, image features for each significant region of all 
training patterns are extracted by feature extractor 54, 
the operation of which will be described with reference 
to Figs. 5 and 6. First, a gray scale image sub-block (i.e. 

25 feature) is selected from a selected significant region 
image block (step S24) stored in significant region 
buffer 52. This feature sub-block is subjected to histo- 
gram equalization at step S26. The intensity of any par- 
ticular point in the image sub-block is adjusted 

30 according to a histogram of the feature sub-block. Histo- 
gram equalization redistributes the pixel gray levels so 
that the dynamic range of a pixel is increased. This can 
help mitigate the effect of wear or staining of bills. Next, 
at step S28, the feature sub-block is subjected to x and 

35 y directional projection. This step forms a feature vector 
Z for the sub-block by accumulating the intensity values 
of all pixels in N consecutive columns (x 1f x 2 . ...x N ) and 
in N consecutive rows (y 1p y 2 . ... yi\i) ° n the histogram 
equalized image block. This extraction method is illus- 

40 trated in Fig. 6. The value x 1( for example, in feature 
vector Z is the accumulated intensity value of all N pix- 
els of column 1 in the N x N pixel block. Given an image 
block of N x N pixels, the feature vector size is 2N. Fea- 
ture vector Z = (X,Y) = (x^, x 2 ,y 2 ■ -x^y^ = fa. *2. 

45 ...z 2N ). This projection method reduces the feature vec- 
tor size and makes the feature vector more shift-invari- 
ant. 

[0023] Next, statistical analysis is performed on the 
extracted feature vectors for each significant region at 

so (step S30) to yield a mean vector M = (m 1p nr^ ... m^) 
and a variance vector V = (v 1( v 2 ... v 2N ) which together 
characterize the significant region. This analysis 
involves storing the first extracted feature vector in 
model document database 56. Then the next feature is 

55 extracted and statistical analysis is performed by vector 
statistical analyzer 55 with reference to the first and sec- 
ond feature vectors to yield interim statistical character- 
istics, which are stored in database 56. Then a third 
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feature is extracted and statistical analysis is performed 
with reference to the third feature vector and the stored 
interim statistical characteristics, and so on until all fea- 
tures have been extracted and final statistical character- 
istics have been generated for the region. These 
statistical characteristics for each significant region are 
stored in a memory (i.e. model document database 56) 
along with the corresponding coordinates of each 
respective region. This model document database will 
thus comprise the images of the documents, e.g. cur- 
rency, to be recognized. As described, only portions of 
the document will be stored rather than the entire docu- 
ment and those portions will be represented by their 
statistical characteristics. Further, each type of docu- 
ment, e.g. a dollar bill, will be represented by a number 
of exemplary documents (variety of dollar bills) and in 
every practical position in the scan space (on the 
platen). Also, the geometrical relationships of all signifi- 
cant regions of all exemplary documents in all positions 
are calculated from their respective stored coordinates 
in the scan space, as will be described hereinafter, and 
are stored in the significant region coordinate memory 
58. Feature extractor 54, including histogram equalizer 
54A and x-y direction projector 54B, and vector statisti- 
cal analyzer 55 may be embodied in software, hard- 
ware, firmware, ASIC or combinations thereof. 
[0024] The second major aspect of the invention is the 
detection mode, which is illustrated in Figs. 7A and 7B. 
This mode is activated each time a trial document is 
scanned (step S34) by, for example, placing the docu- 
ment 32 on the platen 30 (Fig.2) in any orientation and, 
for example, pressing the COPY or START button on 
the copy machine user-interface panel. A two-dimen- 
sional window search (step S36) is performed by win- 
dow search engine 47 (Fig.2) on the low-resolution 
preview image (e.g. 50 or 90 dots-per-inch, hereafter 
dpi) of the scanned document, which has a sufficient 
resolution for detection and comparison purposes. The 
search window size is the same as the block size for the 
significant regions selected in the registration mode, i.e. 
step S10 in Fig. 4. At each searching step, i.e. as the 
search window moves through the scanned document, 
features are extracted from the searching window by the 
same feature extraction procedure used in the registra- 
tion mode. More specifically, a gray scale image sub- 
block (feature) is selected from the window image block 
(step S38). The feature sub-block undergoes histogram 
equalization at step S40 and x and y directional projec- 
tion at step S42. As discussed previously with reference 
to the registration mode, this results in a feature vector 
Z. This feature vector is compared (step S44) to the sta- 
tistical characteristics (mean vector M and variance vec- 
tor V) of each of the significant regions stored in the 
model document database 54. This comparison is 
made by similarity calculator 60 (Fig. 2) as follows. 
Given a feature vector Z, a similarity score is calculated 
for each significant region, e.g. the j-th significant 
region, by the following equation: 



5 

where Mj = (m^, m i2 , ... it^n) is the mean vector for the 

j-th significant region and y = (v^ y\ ^%n) is ^ e 

variance vector for the j-th significant region. If the simi- 
larity score Sj for the feature vector is larger than a 

10 selected threshold T (step S44), then the feature image 
sub-block is determined to be similar to the significant 
region j and the coordinates of the window from which 
the feature vector Z was extracted is stored in a candi- 
date list 62 (step S46). In addition, the coordinates of 

is the matching significant region are stored with the cor- 
responding entry in the candidate list so that relative 
geometric relationship between pairs of candidates can 
be compared with the relative geometric relationship 
between pairs of corresponding significant regions. It 

20 should be noted that the coordinates of matching signif- 
icant regions need not be explicitly stored with the can- 
didate coordinates as long as the coordinates are linked 
together such as with an key or index into model docu- 
ment database 56. Threshold T will be selected accord- 

25 ing to the particular application of the present invention 
and will depend on its tolerance for error. If a nearly 
exact match is required, then similarity threshold T can 
be set very high so that only highly similar documents 
will be recognized. In a less critical application, simiiar- 

30 ity threshold T can be set lower to permit recognition of 
less similar documents. Comparison of the extracted 
feature vector Z with the statistical characteristics of the 
significant regions continues until all significant regions 
have been checked (step S48). The process then 

35 repeats for the next extracted feature vector Z until all 
features have been extracted from the current search 
window (step S50). The process then repeats until the 
window search for the entire document has been com- 
pleted (S52). 

40 [0025] Once the window search is completed, the 
geometrical relationships of the image blocks in the 
candidate list are compared to the geometrical relation- 
ships of the corresponding significant region blocks 
(Fig. 7B). The geometrical relationship between at least 

45 two candidates in the candidate list are determined at 
step S54 by relative geometry analyzer 64. At step S56, 
analyzer 64 determines the geometrical relationship 
between the at least two corresponding significant 
regions. These geometrical relationships are compared 

so at step S58 by geometry comparator 66. Note that since 
the coordinates of all significant regions are pre-stored 
in advance in significant region coordinate memory 58, 
then their geometrical relationship can also be deter- 
mined in advance and stored in memory 58 and simply 

55 accessed by comparator 66 when required. If the rela- 
tionship of the candidates is the same as the relation- 
ship of the corresponding significant region image 
blocks, the trial document is deemed to match the 
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model (S60). If not. a no-match is determined (S62). 
This determination can be used by an action unit 68 of 
the image handling unit 10 to perform some predefined 
function. In a color copier, for example, a match could 
be used to prevent a copy of the trial document from s 
printing or could cause a pre-set pattern to be printed 
such as an all black image or a printed warning against 
unauthorized copying. In a vending machine, for exam- 
ple, a match would enable merchandise to be dis- 
pensed. 10 
[0026] Determination of the geometrical relationships 
can be based on the distance between coordinate 
points, relative angles or combinations thereof. For 
example, Fig. 8 shows a model document 92 having two 
significant regions R1 and R2 with a distance D1 is 
between the regions as determined from their relative 
coordinates by distance calculator 64 A. Remember that 
significant region buffer will include these significant 
regions in the image space locations shown plus in all 
other locations that result from the translatipnal and 20 
rotational affine transformations of this original image. 
Trial document 94 has two candidate window blocks W1 
and W2 with a distance DW1 between them, as deter- 
mined from their relative coordinates. In this example, if 
the distance D1 equals DW1, the trial document 94 is 25 
deemed to match the model document 92. If not, a no- 
match is deemed. Fig. 9 shows another example with 
model document 92 having three significant regions R1 . 
R2 and R3 and distances D1 , D2, again as determined 
by distance calculator 64A, and relative angle 0 30 
between them, as determined from their relative coordi- 
nates by angle calculator 64B. Trial document 94 has 
three candidate window blocks W1, W2 and W3 with 
distances DW1 t DW2 and angle 0 W formed between 
them. If D1 = DW1, D2 = DW2, and 0 = 6 W then a match 35 
is deemed. Selection of the number of significant 
regions, and number of calculated distances and angles 
for comparison will be dependent on the application of 
the present invention, the nature of the documents and 
tolerance for error in the comparisons. Also, the dis- 40 
tances and angles will be judged to be the same or 
equal if they are substantially the same, i.e. within a 
degree of error, e.g. ± 10%. The degree of acceptable 
difference or error will be dependent on the application 
of the present invention. As will be readily appreciated 45 
by one skilled in the art of the present invention, relative 
geometry analyzer 64, including distance calculator 
64A and angle calculator 64B, and geometry compara- 
tor 66 can be embodied in software, hardware, 
firmware, ASIC, or combinations thereof. so 
[0027] Image processing unit 14 may further include, 
for example, a central processing unit (CPU) 70, memo- 
ries including a random-access-memory (RAM) 72, 
read-only memory (ROM) 74 and temporary register 
sets 76, and an input/output controller 78, all connected ss 
to an internal bus 80. Although for the sake of illustration 
each of the above units are shown separately within 
image processing unit 14, these functional units may 



form part or all of the various functional units previously 
described such as image buffer 46, significant region 
buffer 52, significant region extractor 48, relative geom- 
etry analyzer 64, etc. Further, depending on the nature 
of unit 10, e.g. a copier or a scanner/printer, the func- 
tional units may be part of a general purpose computer 
programmed to control the scanning and printing mech- 
anisms. Additionally, it will be appreciated that these 
functional units may be irrplemented with discrete com- 
ponents, application specific integrated circuits, proces- 
sors executing appropriate software and the like or any 
combination thereof. 

[0028] Operating system software and/or application 
specific software for operating device 10 and the func- 
tional units may be stored in any combination of the 
memories 72, 74 and 76 or may be stored externally in 
one or more of the I/O units including hard disc drive 
unit 82, diskette drive unit 84, and compact disc drive 
86, each connected to I/O Bus 92. Software for operat- 
ing the device 10 and/or for implementing the method of 
the present invention may be stored on a medium such 
as hard disc 82A, diskette 84A or compact disc 86A, or 
may be stored at a remote device 88 and input through 
communications interface 90. Further, the model 
images need not be scanned into device 10 but may be 
loaded in as previously stored images from one of the 
mediums 82A, 84A, 86a or remote device 88. However, 
the scan space and coordinate system of device taking 
the model images and the scan space and the coordi- 
nate system of the device taking images of the trial doc- 
uments being tested must be the same or, if different, 
such differences must be accommodated for with, for 
example, a scanning space translator such as a look-up 
table. 

[0029] While the invention has been described in con- 
junction with several specific embodiments, it is evident 
to those skilled in the art that many further alternatives, 
modifications and variations will be apparent in light of 
the foregoing description. Thus, the invention described 
herein is intended to embrace all such alternatives, 
modifications, applications and variations as may fall 
within the scope of the appended claims. 

Claims 

1. An apparatus for recognizing patterns in a trial 
image comprising: 

a feature extractor for extracting features from 
window image blocks in said trial image, said 
window image blocks having respective coordi- 
nates in a coordinate space; 
a memory for storing characteristics of regions 
of a model image, said regions having respec- 
tive coordinates in said coordinate space; 
a similarity comparator for comparing said 
extracted features to said characteristics of 
said regions and for indicating features similar 
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to said regions; 

a candidate list for storing coordinates of win- 
dow image blocks with features indicated by 
said comparator as having similar regions and 
for storing coordinates of corresponding similar s 
regions; 

an analyzer for determining a window image 
block geometric relationship between at least 
two window image blocks in said candidate list 
from said respective coordinates and for deter- io 
mining a region geometric relationship 
between at least two corresponding regions 
from said respective coordinates; and 
a geometry comparator for comparing said win- 
dow image block geometric relationship to said is 
region geometric relationship for recognizing 
said trial image as corresponding to said model 
image when said window image block geomet- 
ric relationship is the same as said region geo- 
metric relationship. 20 

2. An apparatus as in Claim 1 wherein said feature 
extractor comprises a histogram equalizer for redis- 
tributing pixel gray levels in an area of said trial 
image. ? 5 

3. An apparatus as in Claim 2 wherein said feature 
extractor comprises an x-y direction projector for 
forming a feature vector from said trial image area 
with redistributed pixel gray levels. 30 

4. An apparatus as in any one of the preceding claims, 
further comprising a scanner for scanning in a 
model image and an image buffer for storing a 
scanned-in model image. 35 

5. An apparatus as in Claim 4 further comprising a 
region extractor for extracting region image blocks 
from said model image stored in said image buffer 
and a region buffer for storing said extracted region 40 
image blocks. 

6. An apparatus as in Claim 5 comprising an affine 
transformer for translating and rotating said region 
image blocks to create additional region image 45 
blocks for storage in said region buffer. 

7. An apparatus as in Claim 6 wherein said feature 
extractor is coupled to said region buffer for extract- 
ing region features from said model image. so 

8. An apparatus as in Claim 7 wherein said feature 
extractor comprises a histogram equalizer for redis- 
tributing pixel gray levels in an area of said model 
image. 55 

9. An apparatus as in Claim 8 wherein said feature 
extractor comprises an x-y direction projector for 



forming a feature vector from said model image 
area with redistributed pixel gray levels. 

10. An apparatus as in Claim 9 further comprising a 
vector statistical analyzer for forming statistical vec- 
tors characterizing regions of said model image. 

1 1. An apparatus as in any one of the preceding claims, 
wherein said analyzer comprises a distance calcu- 
lator for calculating a distance between said at least 
two window image blocks and a distance between 
said at least two corresponding regions. 

12. An apparatus as in Claim 1 1 wherein said analyzer 
comprises an angle calculator for calculating an 
angle between said at least two window image 
blocks and an angle between said at least two cor- 
responding regions. 

13. A method for recognizing patterns in a trial image 
comprising: 

extracting features from window image blocks 
in said trial image, said window image blocks 
having respective coordinates in a coordinate 
space; 

storing characteristics of regions of a model 
image, said regions having respective coordi- 
nates in said coordinate space; 
comparing said extracted features to said char- 
acteristics of said regions and indicating fea- 
tures similar to said regions in a similarity 
comparator; 

storing coordinates of window image blocks 
with features indicated by said comparator as 
having similar regions and storing coordinates 
of corresponding similar regions; 
determining a window image block geometric 
relationship between at least two window 
image blocks from said respective coordinates 
and determining a region geometric relation- 
ship between at least two corresponding 
regions from said respective coordinates; and 
comparing said window image block geometric 
relationship to said region geometric relation- 
ship and recognizing said trial image as corre- 
sponding to said model image when said 
window image block geometric relationship is 
the same as said region geometric relationship. 

14. A method as in Claim 13 wherein said feature 
extracting step comprises histogram equalizing an 
extracted feature for redistributing pixel gray levels 
in an area of said trial image. 

15. A method as in Claim 14 wherein said feature 
extracting step comprises x-y direction projecting 
said redistributed pixel gray levels of said trial 
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image area and forming a feature vector. 

16. A method as in any one of the preceding claims 13 
to 15, further comprising scanning in a model 
image and storing said scanned-in model image in s 
an image buffer. 

1 7. A method as in Claim 1 6 further comprising extract- 
ing region image blocks from said model image 
stored in said image buffer and storing said 10 
extracted region image blocks in a region buffer. 

18. A method as in Claim 1 7 comprising translating and 
rotating said region image blocks to create addi- 
tional region image blocks for storage in said region is 
buffer. 

19. A method as in Claim 18 comprising extracting 
region features from said model image. 

20 

20. A method as in Claim 19 wherein said region fea- 
ture extracting step comprises histogram equalizing 
an extracted region feature to redistribute pixel gray 
levels in an area of said model image. 

25 

21 . A method as in Claim 20 wherein said region fea- 
ture extracting step comprises x-y direction project- 
ing said histogram equalized region feature and 
forming a feature vector from said model image 
area with redistributed pixel gray levels. 30 

22. A method as in Claim 21 further comprising statisti- 
cally analyzing feature vectors and forming statisti- 
cal vectors characterizing regions of said model 
image. 35 

23. A method as in any one of the preceding claims 13 
to 22, wherein said determining step comprises cal- 
culating a distance between said at least two win- 
dow image blocks and a distance between said at 40 
least two corresponding regions. 

24. A method as in Claim 23 wherein said determining 
step comprises calculating an angle between said 

at least two window image blocks and an angle 45 
between said at least two corresponding regions. 

25. A medium readable by a machine embodying a pro- 
gram of instructions executable by said machine to 
perform a method of recognizing patterns in a trial so 
image, said pattern recognizing method being in 
accordance with any one of the preceding claims 

13 to 24. 

26. A system for comparing a trial document to a model ss 
document and for preventing copying of a trial doc- 
ument matching said model document comprising: 



a scanner for scanning a model document and 
a trial document; 

a feature extractor for extracting trial document 
features from window image blocks in said trial 
document and for determining characteristics 
of regions of said model document, said win- 
dow image blocks having respective coordi- 
nates in a scan coordinate space; 
a memory for storing said characteristics of 
regions of said model document, said regions 
having respective coordinates in said scan 
coordinate space; 

a similarity comparator for comparing said 
extracted features to said characteristics of 
said regions and for indicating features similar 
to said regions; 

a candidate list for storing coordinates of win- 
dow image blocks with features indicated by 
said comparator as having similar regions and 
for storing coordinates of corresponding similar 
regions; 

an analyzer for determining a window image 
block geometric relationship between at least 
two window image blocks in said candidate list 
from said respective coorcfl nates and for deter- 
mining a region geometric relationship 
between at least two corresponding regions 
from said respective coordinates; and 
a geometry comparator for comparing said win- 
dow image block geometric relationship to said 
region geometric relationship for recognizing 
said trial document as matching said model 
document when said window image block geo- 
metric relationship is the same as said region 
geometric relationship; 
a printer; 

an action unit coupled to said geometry com- 
parator and said printer for preventing copying 
of said trial document if it matches said model 
document. 

27. A system for comparing a trial document to a model 
document and for preventing copying of a trial doc- 
ument matching said model document comprising: 

an input device for inputting an image of said 
model document and an image of said trial 
document; 

a feature extractor for extracting trial document 
features from window image blocks in said trial 
document and for determining characteristics 
of regions of said model document, said win- 
dow image blocks having respective coordi- 
nates in an input device coordinate space; 
a memory for storing said characteristics of 
regions of said model document, said regions 
having respective coordinates in said input 
device coordinate space; 



9 



BNSDOCID: <EP 09321 1 5A2_I_> 



17 EP 0 932 115 A2 18 



a similarity comparator for comparing said 
extracted features to said characteristics of 
said regions and for indicating features similar 
to said regions; 

a candidate list for storing coordinates of win- s 
dow image blocks with features indicated by 
said comparator as having similar regions and 
for storing coordinates of corresponding similar 
regions; 

an analyzer for determining a window image 10 
block geometric relationship between at least 
two window image blocks in said candidate list 
from said respective coordinates and for deter- 
mining a region geometric relationship 
between at least two corresponding regions is 
from said respective coordinates; and 
a geometry comparator for comparing said win- 
dow image block geometric relationship to said 
region geometric relationship for recognizing 
said trial document as matching said model 20 
document when said window image block geo- 
metric relationship is the same as said region 
geometric relationship; 
an output device; 

an action unit coupled to said geometry com- 25 
parator and said ouput device for preventing 
copying of said trial document if it matches said 
model document. 

28. A system as in Claim 27 wherein said input device 30 
is a digital camera, a computer or a scanner, and 
said output device is a printer. 
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